Isolated word recognition using a two-pass pattern recognition approach

نویسندگان

  • Lawrence R. Rabiner
  • Jay G. Wilpon
چکیده

recognition approach to isolated word recognition is that poor performance is generally achieved for word vocabularies with acousti-cally similar words. This poor performance is related to the pattern similarity (distance) algorithms that are generally used in which a global distance between the test pattern and each reference pattern is computed. Since acoustically similar words are, by definition, globally similar, it is difficult to reliably discriminate such words, and a high error rate is obtained. By modifying the pattern similarity algorithm so that the recognition decision is made in two passes, improvements in discriminability among similar words can be achieved. In particular, on the first pass the recognizer provides a Set of global distance scores which are used to decide a class (or a set of possible classes) in which the spoken word is estimated to belong. On the second pass a locally weighted distance is used to provide optimal separation among words in the chosen class (or classes) and the recognition decision is made on the basis of these local distance scores. For a highly complex vocabulary (letters of the alphabet, digits, and 3 command words) recognition improvements of from 3 to 7 percent were obtained using the two-pass recognition strategy. L Introduction The standard' pattern recognition approach to isolated word recognition is a 3-step method consisting of feature measurement, pattern similarity determination, and a decision rule for choosing recognition candidates. This pattern recognition model has been applied to a wide variety of word recognition systems with great success [1-31. However the simple, straightforward approach to word recognition runs into difficulties for complex vocabularies, i.e. vocabularies with phonetically similar words. For example, recognition of the vocabulary consisting of the letters of the alphabet would have problems with letters in the sets 4 In the above case the problems are due to the inherent acoustic similarity (overlap) between sets of words in the vocabulary. It should be clear that this type of problem is essentially unrelated to vocabulary size (except when we approach very large vocabularies), since a large vocabulary may contain no similar words (e.g. the Japanese cities list of Itakura [21), and a small vocabulary may contain many similar words (e.g., the letters of the alphabet). It is the purpose of this paper, to propose, discuss, and evaluate a modified approach to isolated word recognition in which a 2-pass method is used. The output of the first recognition pass …

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Fuzzy Clustering Approach Using Data Fusion Theory and its Application To Automatic Isolated Word Recognition

 In this paper, utilization of clustering algorithms for data fusion in decision level is proposed. The results of automatic isolated word recognition, which are derived from speech spectrograph and Linear Predictive Coding (LPC) analysis, are combined with each other by using fuzzy clustering algorithms, especially fuzzy k-means and fuzzy vector quantization. Experimental results show that the...

متن کامل

Classification of emotional speech using spectral pattern features

Speech Emotion Recognition (SER) is a new and challenging research area with a wide range of applications in man-machine interactions. The aim of a SER system is to recognize human emotion by analyzing the acoustics of speech sound. In this study, we propose Spectral Pattern features (SPs) and Harmonic Energy features (HEs) for emotion recognition. These features extracted from the spectrogram ...

متن کامل

Prediction of dispersed mineralization zone in depth using frequency domain of surface geochemical data

Discrimination of the blind and dispersed mineralization deposits is a challenging problem in geochemical exploration. The frequency domain (FD) of the surface geochemical data can solve this important issue. This new exploratory information can be achieved using the interpretation of FD of geochemical data, which is impossible in spatial domain. In this research work, FD of the surface geochem...

متن کامل

Analysis of Isolated Word Recognition for Kannada Language using Pattern Recognition Approach

The speech recognition can be done with two approaches. In the first approach called as Isolated Word Recognition (IWR), the problem is to identify each word as an individual unit. The second approach is Continuous Speech Recognition (CSR), where speech must be broken into smaller units for identification. We have developed an Isolated Word Recognition (IWR) technique for identification of spok...

متن کامل

Two-pass Algorithm for Large Vocabulary Continuous Speech Recognition

This paper presents a two-pass algorithm for Extra Large (more than 1M words) Vocabulary COntinuous Speech recognition based on the Information Retrieval (ELVIRCOS). The principle of this approach is to decompose a recognition process into two passes where the first pass builds the word subset for the second pass recognition by using information retrieval procedure. Word graph composition for c...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1981